Leveraging range joins for the computation of overlap joins
نویسندگان
چکیده
Abstract Joins are essential and potentially expensive operations in database management systems. When data is associated with time periods, joins commonly include predicates that require pairs of argument tuples to overlap order qualify for the result. Our goal enable built-in systems support such joins. In particular, we present an approach where formulated as unions range joins, which more general purpose compared i.e., useful their own right, supported well by B+-trees. The sufficiently flexible it also supports additional equality predicates, open, closed, half-open periods over discrete continuous domains, thus offering both generality simplicity, important a system setting. We provide stand-alone solution performs on par state-of-the-art DBMS embedded able exploit standard indexing clearly outperforms existing solutions depend specialized techniques. offer analytical empirical evaluations proposals. study includes comparisons pertinent proposals offers detailed insight into performance characteristics
منابع مشابه
Overlap Set Similarity Joins with Theoretical Guarantees
This paper studies the set similarity join problem with overlap constraints which, given two collections of sets and a constant c , �nds all the set pairs in the datasets that share at least c common elements. This is a fundamental operation in many �elds, such as information retrieval, data mining, and machine learning. The time complexity of all existing methods is O (n2) where n is the total...
متن کاملFaster Joins , Self - Joins and
We propose a new algorithm, called Stripe-join, for performing a join given a join index. Stripe-join is inspired by an algorithm called \Jive-join" developed by Li and Ross. Stripe-join makes a single sequential pass through each input relation, in addition to one pass through the join index and two passes through a set of temporary les that contain tuple identiiers but no input tuples. Stripe...
متن کاملIndexing Schemes for Efficient Aggregate Computation over Structural Joins
With the increasing popularity of XML as a standard for data representation and exchange, efficient XML query processing has become a necessity. One popular approach encodes the hierarchical structure of XML data through a node numbering scheme, thus reducing typical queries to special forms (structural, path, twig) of containment joins. In this paper we consider how using an index can facilita...
متن کاملFaster Joins, Self Joins and Multi-Way Joins Using Join Indices
We propose a new algorithm called Stripe join for performing a join given a join index Stripe join is inspired by an algorithm called Jive join developed by Li and Ross Stripe join makes a single sequential pass through each input relation in addition to one pass through the join index and two passes through a set of temporary les that contain tuple identi ers but no input tuples Stripe join pe...
متن کاملInspector Joins
The key idea behind Inspector Joins is that during the I/O partitioning phase of a hash-based join, we have the opportunity to look at the actual data itself and then use this knowledge in two ways: (1) to create specialized indexes, specific to the given query on the given data, for optimizing the CPU cache performance of the subsequent join phase of the algorithm, and (2) to decide which join...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: The Vldb Journal
سال: 2021
ISSN: ['0949-877X', '1066-8888']
DOI: https://doi.org/10.1007/s00778-021-00692-3